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Abstract 

Biological networks are one of the most studied object in computational biology. Several 
methods have been developed for studying qualitative properties of biological networks. Last 
decade had seen the improvement of molecular techniques that make quantitative analyses 
reachable. One of the major biological modelling goals is therefore to deal with the quantitative 
aspect of biological graphs. We propose a probabilistic model that suits with this quantitative 
aspects. Our model combines graph with several dynamical sources. It emphazises various 
asymptotic statistical properties that might be useful for giving biological insights. 
Keywords: average-case analysis; Biological networks; Markov chains; Dynamical sources. 

1 Introduction 

Biology is concerned with the study of biological functions that change chemical components into 
another. The biological system that provides such transformations produces both biological matter 
and energy. Genetic and biochemical investigations during the last decade have changed our under- 
standing of biological transformation processes. Today, molecular techniques allow the description 
of complex networks of interacting macromolecules that are responsible of theses phenomenological 
transformations. Understanding biological molecular network is therefore one of the major goal of 
the modern biology. Systems biology uses computational approaches in this purpose (see [TU] for 
overview). In this context, our aim is studying the dynamic of molecular interaction networks. 

Until now, precise and quantitative informations on reaction mechanisms have been roughly 
available for most networks of interest. Therefore, based on different formalisms, various studies 
have focused on the qualitative modelling of dynamical networks [7]. Furthermore, because biolog- 
ical knowledge is incomplete, qualitative modelling has appeared as an appropriate framework for 
reasoning on the biological network and refining the graph whenever additional biological knowledge 
becomes available. By now novel methods in biological data acquisition introduce the ability of high 
throughput measurements that efficiently map out a network of interacting macromolecules. It thus 
gives various quantitative insights of biological systems. One of the major biological modelling goals 
is therefore to deal with the quantitative aspect of dynamical networks with a special emphasis on 
the quantitative reasoning that naturally follows the qualitative reasoning. We propose here a novel 
probabilistic model that suits this quantitative aim. We consider the cost of each macromolecular 
interaction for emphasizing the major quantitative pathways among others qualitative. 

Formerly, we study the typical behavior of the cost on a pathway that pass through the biological 
graph. These costs are sums of elementary (fixed) toll costs on the edges taken by paths. This 
measure on a graph G with r vertices can be exactly described by a r x r real matrix whose element 
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is the toll cost of the edge Later, this matrix is denoted by C(G). We aim at studying 

random costs variables, it is thus necessary to ensure that they are not degenerated. A toll cost 
matrix C(G) is said to be non degenerated if for all strictly positive integer n, there exists at least 
two paths of length n in graph G whose measures differ. 

When a path is taken at random over some probabilistic model, the total cost itself is a real 
random variable. A simple probabilistic framework for generating random paths on a graph is to 
weight properly the edges of the graph by some fixed probabilities. This model is a classical Markov 
chain model. In their book [5] , Meyn et al. provides a general review of classical results on paths 
in Markov chains. 

Here, we define a more general probabilistic framework that in some sense extends Markov 
chains. The edges of the graph are weighted by some dynamical sources, a general model presented 
in the context of pattern matching in [3l SJ [5l [11] . 

Dynamical sources describe non-Markovian processes, where the dependency on past history is 
unbounded, and as such, they reach a high level of generality. A probabilistic dynamical source 
is defined by two objects: (i) symbolic mechanism and (ii) density. The mechanism, related to 
symbolic dynamics, associates an infinite word M(x) to a real number x € [0, 1], and generalizes 
numeration systems. Once the mechanism has been fixed, the density on the [0, 1] interval can 
vary. This induces different probabilistic behaviours for sources of words. Later, we establish a 
correspondence between paths and words over an appropriate alphabet. 

Therefore, a dynamical graph is defined by the combination of a graph and several dynamical 
sources. Like this, one dynamical source is associated with each vertex of the original graph. 

In this context, it is proved that the total cost of a path in the dynamical graph follows asymp- 
totically a gaussian law. 

Definition 1. [Asymptotic gaussian law.] Consider a cost R defined on a set TZ and its restriction 
R n to the subset 7Z n of size n. The cost R asymptotically follows a gaussian law as n — > +oo if 
there exist three sequences a n ,b n ,r n , with r n — > 0, for which 
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The sequence r n defines the speed of convergence, denoted also by r[R n ]. The expectation E[R n ] 
and the variance V[i?„] satisfy E[R n ] ~ a„, V[i?„] ~ b n . The triple (E[R n ],V[R n ],r[R n ]) is a 
characteristic triple for the gaussian law of R. 

Main Theorem. Let Q = (G, V) be a nic£\ dynamical graph and C{G) be a non degenerated toll 
cost matrix on graph G. 

The total cost of a path of length n in the dynamical graph Q, denoted by C n (Q), follows an 
asymptotic gaussian law asn-> +oo with a characteristic triple given by r[C n (Q)] = 0(1/ y/n) 

E[C n (g)} =lg-n + i g + 0(j%) and V[C n (5)] =v g -n + v' g + 0{j%) 
The constants 7^ and vg are expressible with the pression A(t) of the operator T(e*) defined in 
\3.2]) . namely jg = A'(0), vg = A"(0), while fig < 1 is any real number strictly larger than the 
subdominant eigenvalue ofT. 

Such a definition leads to formulate various tools. 



1 The word "nice" is defined in Def. 4, Section 4.1, the word "primitive" in Section 4.1 
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2 Various tools 



We propose a probabilistic model for paths that compose a randomized graph. We relate the 
study of basic parameters of these paths with formal generating functions. It defines the dynamical 
sources model and provides major properties for their related generating operators. 

2.1 Probabilistic model and generating functions 

Let G = (V, E) an oriented, strongly connected graph with r vertices V = {1, . . . , r}. A path of 
length n in the graph in a n-uplc W = w\ — > W2 — > . . . — > w n , where Wi € V for all i € {1, . . . , n} 
and (tUj, tUj+i) G £ is an edge for all i G {1, . . . , n — 1}. We denote by W n (G) the subset of V n 
composed by all possible paths of length n in graph G and by yV*(G) the subset of V* composed 
by all possible paths of any length in graph G. 

We focus on graphs that are strongly connected (i.e., for all pairs of vertices (i, j), there exists 
a path that links i to j. This technical request is crucial since it ensures that the transition matrix 
of the graph is primitive and >V n (G) as well. 

Random paths are drawn according to an induced probability on W*(G). In the sequel, we 
denote by pw the probability that a random (infinite) path begins with the (finite) path W (of 
length n). 

Now, a measure on these paths is a function c : W*(G) — > R whose restriction G„ to W n (G) is 
a real random variable. Our aim is to analyze its probabilistic behavior as n — > +oo, when c is a 
measure that can be described by toll cost values on edges (i.e., each edge contributes by a fixed 
value to the total measure). Our main tool is the moment generating function of G„, defined as 

E[exp(tG„)] := ]T pw ■ exp[tc(W)], (2.1) 
wew n (G) 

and the major issue is showing that it behaves as a "quasi-power" . Then, Hwang's quasi-power 
theorem [6] is used to conclude to an asymptotic Gaussian law. 

Theorem 1 (Hwang) Let Y n be a sequence of variables whose moment generating functions sat- 
isfies when n — > +oo E [exp(£F„)] = [exp(nU(t) + V(t))] ■ [1 + 0(W„)], W„ — ► oo, with a uniform 
error term on the complex closed disk D(to) := {|t| < to}, to > 0. Suppose that U(t) and V(t) are 
analytic in D(to) and U(t) satisfies U"(0) ^ 0. Then, Y n follows an asymptotic gaussian law, with 
a characteristic triple given by 

E [Y n ] = U'(0) ■ n + V'(0) + 0(W n ), Y[Y n ] = U"(0) ■ n + V"(0) + 0{W n ), 
r[Y n ] = 0(max(l/Vn,W n )). 

2.2 Bivariate generating functions 

The so-called probability generating function F c {z,u) relative to parameter c is defined as 

F c (z,u)= W'« c(ff) -/', (2.2) 
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where \W\ denotes the length of the path W, the variables z and u respectively mark the length of 
the path and the parameter c(W). Remark that the moment generating function of parameter G n 
is closely related to F c (z,u) via the relation 



where the notation [z™]G(z) denotes the coefficient of z n in G(z). We now express the generating 
function F c (z, u) in a form that simplify the coefficient extraction. 

2.3 Transition matrix and generating functions 

The transition matrix T := (Ti,j) of the graph G (with r vertices) is the r x r matrix which clement 
of index equals 1 iff (i, j) is an edge of G (and otherwise). Thus. T n is a {0, 1} matrix that 
indicates if there exists at most one path in the graph from one state to another. 

Focussing on the paths, it is convenient to deal with a marked version of the transition matrix 
T := {Tij) such that: %j = ejj if T i: j = 1 and %j = otherwise. In the sequel, E = £(G) := 
S E} refers to the set of all possible edges, and S, := {e^j, € E} refers to the edges 
that leave vertex i. Notice that a path in G corresponds to a string over S*. The matrix T plays a 
fundamental role. Indeed, the component of the matrix T" is the set formed by all the paths 
of length n which allow to reach state j from state i. In addition, the component of the matrix 
T* is the set formed by all possible paths that allow to reach state j from state i using an arbitrary 
number of steps. Finally, the set W n (G) of all possible paths of length n is expressed formally by 



where 1 := (1, . . . , 1) is the r length vector of 1. 

We now present the probabilistic model for path generation. This model is based on dynamical 
systems. Probabilities of passing through an edge are "generated" by operators, and the main gen- 
crating function can be generated itself by operators. Furthermore, unions and Cartesian products 
of sets translate into sums and compositions of the associated operators. This allows to define a 
matrix generating operator related to a graph and its associated probabilistic model. 

2.4 Dynamical sources 

We first recall the definition of a dynamical system (of the interval) (See [HI [5] for details). 

Definition 2. A dynamical system T> = (T,S) is defined by four elements: 

(a) a finite alphabet A, 

(b) a topological partition ofX :=]0, 1[ with disjoint open intervals X m , m £ A, 

(c) an encoding mapping a which is constant and equal to m on each I m , 

(d) a shift mapping S whose restriction to T m is a bijection of class C 2 from X m to J m := S(T m ). 
The local inverse of S\x m is denoted by h m . 

Such a dynamical system can be viewed as a "dynamical source" , since, on an input x of X, it outputs 
the word M(x) formed by the sequence of symbols aS^{x), i.e., M{x) := (ax, aSx, aS 2 x, . . .). 
The branches of S k , and also its inverse branches, are then indexed by A k , and, for any w = 
mi . . . TOfc <G A k , the mapping h w := h mi o h m2 o ■ ■ ■ o h mk is a C 2 bijection from J w onto I w . It is 
possible that the word w cannot be produced by the source: this means that J w is empty, and the 



E[exp(tG„)] = [z n ]F c (z,e*) 



(2.3) 



W„(G) = 1 ■ T n ■ *1, W*(G) = 1 ■ T* ■ % 



(2.4) 
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inverse branch h w does not exist. All the words that begin with the same prefix w correspond to 
real numbers x that belong to the same interval T w . 

Such sources may possess a high degree of correlations, due to the geometry of the branches [i.e., the 
respective positions of intervals T m and Jg := S(Ti)\ and also to the shape of branches [See [5] for 
more details]. For instance, classical sources correspond to dynamical systems with affinc branches, 
for which the derivatives are constant. In other words, the probability of emitting a symbol m is 
closely related to the shape of branches. 



2.5 Probabilities and generating operators 

When the interval X is endowed with some density g, this induces a probabilistic model on A N , 
and the probability p w that a word begins with prefix w is the measure of the interval T w . Such a 
probability p w is easily generated by an operator G^j, defined as 

G H [f](x) = \h' w (x)\ foh w (x)l Jw (x), (2.5) 

since one has p w = I g(x)dx = / \h' w (x)\g o h w (x)dx = / G[ w ] [g)(x)dx. (2.6) 
Ji m Jj w Jo 

The operator Gt w ] is called the generating operator of the prefix w. The generating operator L 
relative to a collection C of words is defined as the sum of all the generating operators relative to 
the words of C, namely L := Ylwec an< ^ ^ e g cncra thig operator G of the alphabet A 

G:=E G K ( 2 - 7 ) 

tn£A 

plays a fundamental role here, since it is the density transformer of the dynamical system; it 
describes the evolution of densities on X under iterations of S: if X is a random variable with 
density g 7 then SX has density G[g]. 

For two prefixes w,w', the relation p w . w > = p w p W ' is no longer true when the source has some 
memory, and is replaced by the following composition property 

G[ w , w >] = G[ w /] o G|y], (2.8) 

so that unions and Cartesian products of collections of words translate into sums and compositions 
of the associated generating operators. Remark just that, due to (|2.8p . the generating operator of 
C x M is M o L. 



3 Dynamical graph 

We construct a probabilistic model for paths by combining both information of the graph and 
information of several dynamical systems. This provides the so-called dynamical graph. 

A dynamical graph Q is defined by a pair (G, T>) where G = (V, E) is an oriented strongly 
connected graph with r vertices (V = {1, . . . , r}) and V = (T>\ , . . . , T> r ) is a sequence of r dynamical 
sources. The dynamical source P, corresponds to the vertex i. It is defined by the alphabet 

:= {eij, {hj) £ E} and describes the probability of leaving the vertex i. Indeed, previously we 
suggested that the dynamics of T>i are properly represented by generating operators G i; [ e ], e is any 
symbol/edge of Thus, if at a time, we are in vertex i with some density g that corresponds 
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to all the previous edges traversed by the path before i, the probability of taking the edge is 
given by 

Pj\*-9 = / G i.[e, 3 \[g](t)dt. 

Jo 

Remark. When all the dynamical systems T>i are simple Bernoulli sources, the dynamical graph 
is a Markov chain (of order 1). In this case, if there exists an edge from vertex i to vertex j, the 
probability of taking the edge is the conditional probability pju. Thus, dynamical graphs 

extend the Markov process model by introducing a high level of correlations between the different 
edges taken by the path. 

Here, we present two distinct tools dedicated to dynamical graphs. The first is a matrix gener- 
ating operator. The second is the (flat) generating operator of a (mixed) dynamical source closely 
related to the dynamical graph. These two objects are conjugated and share the same spectrum. 

3.1 Matrix generating operator 

We transform the transition matrix of a graph into a matrix generating operator that combines 
both information from the graph and the different dynamical sources of vertices. We associate to 
each element Tj i of the marked matrix T, its generating operator T^j 

T,j := Gj [ ei j ] (3.1) 

T is a matrix generating operator which is related to *T, due to (|2.8|) . 

In order to represent properly the generating function of the parameter c, when c is a toll 
based measure, we introduce some "perturbations" in the matrix operator. The perturbed matrix 
operator T(u) is a matrix operator whose elements are 

T i)3 -(u) —u^Tij, (3.2) 

where Cj j is the toll cost of edge («, j). Obviously, T(l) corresponds to the unperturbed version of 
the matrix operator. Thanks to (|2.4[) and (|2.6p . we express F c (z,u) by means of the perturbed 
matrix operator T(u), 

F c (z,u)= f 1- (I-zT^^-^g^dx, 
Jo 

where g is an initial given density. The coefficient of z n in F c (z,u) corresponds to the n-th power 
of the matrix operator, consequently, the moment generating function of the cost C n satisfies 

E [exp(iC n )] = f 1 • T(e') n • ^[g^dx. (3.3) 
Jo 

3.2 Mixed source 

We build a source Sg that combines both a transition matrix T of a dynamical graph Q, and 
the original sources T>i, . . . ,V r . The set of vertices of the underlying graph G is V := {1, . . . , r} 
and the transition matrix T of G has order r. For all i, the source T>i is defined by an interval 
2^ = [0, 1], an alphabet Si, a topological partition (Im )mes, an d a shift which local inverse 
h® := (S^)- 1 maps J® :=]c#,d&>[ on 2$ :=]a$,6$[. 
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The source Sg is denned with the interval Ig = [0,r], the alphabet S := 1J[ =1 a topological 
partition (X m ) mg s and a shift function Sg that maps Ig on Tg. Each local inverse /i m maps J m 
on I m . More precisely, if m = etj € 

I m = I« + i - 1 :=]oW + i - 1, + i - i[, J m = + j- _ i := ] Cm + j - 1, d m + i - 1[, 

and h m (x) = h m (x — j — 1) + £— 1. The density transformer © of the source 5g defined, as in (|2.7p . 

^ <S[/](s) := E -f°hm(x) ■ l Jm (x), (3.4) 

is conjugated to the matrix operator T defined in (|3.1[) via a mapping W [namely © = oTof] 
which associates to g (defined on X) the vector * [gi, . . . , g r ] where each gi is defined on 2"M by 

4 Probabilistic behavior of toll based measures 

For studying the parameter of interest, the major issue is to prove that both graph and dynamical 
systems possess good properties. It is the same for the source Sg associated to the dynamical graph 
Q = (G,T>). As mentioned in [3], we consider dynamical automatons and extends the results to 
dynamical graphs. 

4.1 Nice sources and convenient sources. 

Under general hypotheses, and on a convenient functional space, the density transformer admits 
A = 1 as an eigenvalue of largest modulus. Nevertheless, this is not a unique dominant eigenvalue 
isolated from the remainder of the spectrum. 

Definition 3. A dynamical source is said to be decomposable if the density transformer G [defined 
in {2.7]) ] has a unique dominant eigenvalue (equal to 1) separated from the remainder of the spectrum 
by a spectral gap, i.e., p := sup{|A| ; A G Sp G, A =/= 1} < 1, when acting on a convenient Banach 
space T . 

Remarks. The terminology considers the dominant eigenfunction ip which is an invariant function 
for G. Under the normalization condition J Q ip(t)dt = 1, this last object is unique too, and it 
is also the (unique) stationary density. Due to the existence of the spectral gap, the operator G 
decomposes into two parts, namely G = AP + N, where P is the projection of G onto the dominant 
cigcnspacc generated by <p, and N, relative to the remainder of the spectrum, has a spectral radius 
equal to p, which is strictly less than 1. The operator N describes the correlations of the source. A 
decomposable dynamical source is ergodic and mixing with an exponential rate equal to p. 

Most of the classical sources -memoryless sources, or primitive Markov chains- are easily proven 
to be decomposable. We present sufficient conditions under which a general dynamical source is 
proven to be decomposable, together with all its associated mixed sources Sg [the proofs are omitted 
here] . 

Definition 4. A dynamical source (on a finite alphabet) is said to be "nice" if it satisfies the two 
conditions 

(i) [Expansiveness] There exist two constants C, D with D > 1 for which one has, for any m E S, 
for any x G T m , D < \S'(x)] < C. 
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(ii) [Topologically mixing] For any pair of two nonempty open sets (V, W), there exists no > 1 such 
that S~ n V nW ^ for all n > n . 

Proposition 1. A nice dynamical system is decomposable, with respect to the space BV(I) of 
functions with bounded variation, endowed with the norm ||/|| := sup|/| + V(f) [Here, V(f) is 
the total variation of f on X]. 

We consider now the mixed source Sg . Note here that a transition matrix T is primitive if there ex- 
ists a power of the matrix T whose coefficients arc never the empty language. A strongly connected 
graph produces a matrix T, primitive if and only if the geds of the lengths of its cycles equals 1. If 
it is not primitive, the gcd d of its cycle lengths is called the period, and T d is primitive. 

Proposition 2. Let Q = (G, V) be a dynamical graph. IfT> is a sequence of nice dynamical sources 
and the transition matrix of G is primitive, the mixed source Sg is nice too. 

4.2 Proof of the main theorem. 

The main theorem is then a consequence of all the previous facts. Indeed, thanks to Propositions 2 
and 3 the density transformer © of the mixed source Sg has dominant spectral properties, and by 
conjugation and perturbation theory, this transmits to the quasi-inverses of marked operator T(it), 
when u remains in a neighbourhood of 1. Finally, T(u) admits the decomposition 

T(u) n = A(«) n P + N(u), 

in a complex neighbourhood of u = 1. Then, with (|3 . 3[> . the moment generating function of the 
total cost C n behaves as an approximate n-th power. We conclude by using Hwang's quasi-power 
Theorem [6] (See 2.1). 

5 Examples of toll based measure schemes 

We present here several measure schemes. They all consist in a sum of elementary (fixed) toll costs 
on the edges taken by the path. Thus, a measure on a graph G with r vertices can be exactly 
described by a r x r real matrix whose element is the toll cost of the edge 

5.1 Counters on edges 

Here, we want to have access to statistics on the number of times a given edge is taken. It is surely 
the simplest measuring scheme. Nevertheless, it is quite important since it is the basis of all the 
next measure schemes. It consists in marking a single transition (i,j) by a cost equal to 1. Thus 
the toll cost matrix is a matrix of zeros except at the position 

5.2 Counters on vertices 

In order to count the number of times we reach a given vertex, it is sufficient to consider all the 
edges that point to this vertex. More formerly, the number of times a given vertex k is reached is 
obtained by using a toll cost matrix such that c^k = 1 if (i, k) G E and equals otherwise. 

From an applicative point of view, it also proves useful to have some real toll costs for the edges 
that point to a given vertex. This allows to reflect for instance the stoichiometry of a reaction 
(represented by an edge) between products (represented by the vertices). 
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5.3 Counters on pathways 

The last measure scheme consists in counting particular pathways (here, a pathway is a possible 
(finite) succession of several edges in the graph that does not contain any cycle). This kind of 
statistic is useful to compare different pathways that link the same vertices in a graph. 

This measure is done by duplicating all the vertices along the pathway. This new graph is called 
G' . The copy is used to count the pathway of interest (by counting its last transition) which is 
removed from the original graph (by removing its first transition). Finally, a copy of appropriate 
edges (all the edges that leave a vertex of the path) ensures that two graphs G and G' share the 
same sets of paths (i.e., W„(G') = W n (G) for all n and W*(G') = W*(G)). 

More formerly, let P = io — > i\ i k be a pathway. We create a new graph G' = (V, E'), 

where V = V U . . . ,i' k } (here, i'. stands for the copy of vertex ij), and E' expresses as E' = 
{E\{(i ,i 1 )})UC 1 UC 2 , where 

C 1 = {(i ,i[}U{(i' j ,i' j+1 ), i = l...fc-l}, 

corresponds to the path and 
fe-i 

C2 = |J U{^,£), {i h t) e £ and £ ^ i j+1 } U {(i' k ,£), (i k> £) e E}, 
3=1 

corresponds to the edges needed in order to keep the same sets of paths. 

Now, for counting the number of times a given pathway C is taken, it is sufficient to count the 
number of times transition (i' k _ 1 ,i' k ) is taken. 

Consider the following example in which we want to count pathway C = 1 — ^ 2 — > 3. 
Original graph G Modified graph G' 




6 Numerical experiments 

Here we give two different ways of using the results of this paper. 

The first use is when all the parameter of the model (namely its underlying graph and the 
probabilistic model of each transition) are fully determined. The model is then used as a simula- 
tor and provides some theoretical evidences that may be confirmed later (typically by biological 
experiments for biological networks). 

The second application is a help in the modelling process. Here, the graph is known entirely but 
the probabilistic model is just partially known. By adding some experimental observed knowledge 
(an evaluation of a possible measure), it is possible to get some information on the unknown. 

We provide some results when the probabilistic model is the simplest one, i.e., the probability 
of taking one edge is fixed. We will consider a matrix of real numbers in the place of a matrix of 
operators. 
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6.1 Simulation 



Consider the following graph, transition matrix operator T and cost matrix C (that marks the edge 
(1,2) 




/OA 0.2 0.4\ /0 1 0\ 

T = 0.7 0.2 0.1 C= 

\0.2 0.7 0.1/ \0 0/ 



The dominant eigenvalue X(u) of the marked matrix operator T(u) equals 

_ K(u) 2 / 3 + 208 + 168tt + UKju) 1 / 3 
{U) ~ 60A'(u)V3 > 

where K{u) = 20276 + 2448m + 12\/2792481 + 537960u - 80688u 2 - 32928u 3 . Thus A'(l) = 0.0896 
which indicates that the edge (1,2) is taken approximately 8.9% of times in long paths in the graph. 



6.2 Modelling 

Consider now the following graph, with a transition matrix operator T partially known 

n 




Nevertheless, it is observed that edge (1,3) is taken in average between 30% and 40% of times 
while edge (3, 1) is taken in average between 20% and 30% of times. The question is "does it give 
any information for x, y and z ?" . Obviously, yes. This problem can be seen as a constraint problem 
with 3 variables over the continuous domains D x = D y = D z =]0, 1[ with some constraints that 
are inequalities based on the observations. Eigenvalues A(i j 3)(m) and \( 3 ^(u) associated to the 
observations express symbolically with x, y and z. One has the constraints 

0.3<A' (li3) (l)<0.4, 0.2<A' (3il) (l)<0.3. 

We discretize the three domains and keep only 100 regularly spaced values and then operate an 
exhaustive determination of all possible graphs (among the 100 3 possible ones) that satisfies the 
first constraint, the second constraint and both the two constraints. This is represented by the 
following 3D pictures. 
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First constraint Second constraint Both 

The graphs of interest correspond to the common part, a small part of the entire cube [0, l] 3 . 



7 Conclusion and perspectives 

Our study illustrates toll based measure schemes in a large class of probabilistic model of graph. 
It shows (i) wide generality on both measure and probabilistic model and (ii) entails theoretical 
results on graphs that might be useful in various applications with a special emphasis on biological 
networks application. Our quantitative analysis characterizes the major impact of other measure 
scheme, such those related to the waiting time before taking a given transition. Note here that 
they surely do not obey asymptotically to a Gaussian law. Behind these theoretical items, our 
model rises various perspectives. One of them considers a multivariate framework where the study 
concerns the asymptotic behavior of a vector of different measures. In this context, and for a vector 
of toll based measure, the work of Bender, et al. [1] might be particularly useful. Finally, for a 
modeling purpose, the determination of n unknown parameters of the probabilistic model consist in 
finding an appropriate part of the hypercube [0, l] n . Here shows the importance to design efficient 
methods that find good approximations of this part. The local search community develops several 
methods such as tabu search or genetic algorithms that are still to be considered for investigating 
graph properties. 
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